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SOME  LOCALLY  OPTIMAL  SUBSET  SELECTION  RULES* 

By  Deng-Yuan  Huang  and  S.  Panchapakesan 
Academia  Sinica,  Taipei  and  Southern  Illinois  University 

1.  Introduction.  Let  . be  k  +  1  independent  populations 

where  has  the  associated  density  function  f(x,e^)  with  the  unknown  para¬ 
meter  belonging  to  an  interval  0  of  the  real  line.  In  this  paper,  we 

are  concerned  with  two  types  of  goals.  The  first  is  to  select  from 

those  populations,  if  any,  that  are  better  (to  be  defined)  than  nQ  which 
is  the  control  or  standard  population.  The  other  is  to  select  a  subset 
(preferably  small  in  size)  of  the  k  populations  it,  that  will  contain 

the  best  (suitably  defined)  among  them.  In  the  recent  years,  the  attention 
has  increasingly  been  focussed  on  the  construction  of  optimal  selection 
rules.  Some  of  the  important  papers  in  this  direction  are  Bickel  and  Yahav 
[1],  Chernoff  and  Yahav  [2],  Goel  and  Rubin  [4],  Gupta  and  Hsu  [5],  Gupta 
and  Huang  [6,7],  and  Gupta  and  Kim  [10].  These  investigations  deal  with 
the  symmetric  case  which  implies  equal  sample  sizes.  There  have  been  some 
investigations  in  the  unequal  sample  sizes  case  but  these  relate  to  ad  hoc 
and  heuristic  procedures  and  are  not  generally  successful  in  establishing 
the  least  favorable  configuration  (LFC)  for  the  probability  of  a  correct 
decision.  For  many  classical  procedures  in  the  literature  for  selecting 
a  subset  containing  the  best,  the  LFC  turns  out  to  be  e,  =  ...  =  0^.  This 
provides  the  motivation  for  seeking  selection  rules  which  are  optimal  in  a 
suitable  sense  in  a  neighborhood  of  every  equi-parameter  point.  When  com¬ 
paring  these  populations  with  n  ,  the  local  optimality  concerns  with  the 

*This  research  was  supported  by  the  Office  of  Naval  Research  Contract 
N00014-75-C-0455  at  Purdue  University.  Reproduction  in  whole  or  in  part  is 
permitted  for  any  purpose  of  the  United  States  Government. 
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ability  of  the  rule  to  reject  any  population  that  is  inferior  to  nQ,  and 
to  choose  a  population  which  is  superior  while  all  others  are  inferior. 

Section  2  deals  with  selecting  populations  that  are  superior  to  a  con¬ 
trol  and  as  the  applications  indicate  we  can  handle  unequal  sample  sizes. 

The  next  section  discusses  the  usual  goal  of  selecting  a  subset  containing 
the  best  and  locally  optimal  rules  based  on  ranks.  Though  the  density  func¬ 
tion  f(x,e)  is  assumed  to  be  known  but  for  the  value  of  e,  a  rule  based  on 
ranks  is  justified  on  the  basis  of  usual  robustness  considerations  in  that 
the  ranks  are  insensitive  to  outliers  and  there  could  be  possible  deviations 
from  the  model . 

For  detailed  discussions  on  multiple  decision  problems  and  general  de¬ 
cision  theoretic  approach,  the  reader  is  referred  to  Gupta  and  Huang  [8] 
and  Gupta  and  Panchapakesan  [12]. 

2.  Comparison  with  a  control.  We  want  to  construct  a  rule  to  select 
all  populations  that  are  better  than  the  control.  Let  { X .  - } ,  j  =  l,...,n., 

1  J  1 

denote  the  sample  observations  from  i  =  0,1, ...,k.  The  selection  rule 

will  depend  upon  the  observations  through  the  statistics  T^Q,  i  =  l,...,k, 

where  T..  is  suitably  defined  to  indicate  the  difference  between  u-  and 

'  J  *  J 

For  fixed  n  ,  ...,n.,  we  assume  that  T. .  has  a  density  function  g  (t..)  de- 

o  K  1J  lj 

pending  on  the  parameter  t.^.  Usually  the  T. .  are  invariantly  sufficient 

t  J  I  J 

for  t. ..  We  use  the  parameter  t . •  as  our  measure  of  'distance'  between  n. 

'  J  '  J  1 

and  Ttj .  Now,  is  the  same  for  all  i  and  this  common  value  is  denoted  by 
t*.  In  the  case  of  a  location  parameter  e,  we  may  have  t..  =  | e . -  e . |  and 

i  J  i  J 

in  this  case,  t*  =  0.  On  the  other  hand,  if  e  is  a  scale  parameter,  we  can 

/ei  9.  \ 

define  =  max  I  —  ,  1 ,  which  gives  t*  =  1.  We  define  population  it.. 

J  i 

to  be  superior  if  >  t*  and  inferior  otherwise. 
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Let  a  -  { x  | t  -  (t-j0 


’""’Tko^  ’  “io  ^  - 1 T  i  o  —  ■  '  "io 


<  T*}  and  n*  = 


{t  I  Tj0  =  T*<  Tio»  j=l ....  In  other  words,  i2iQ  is  the  set  of  configura¬ 

tions  for  which  ir^  is  inferior  and  is  the  set  of  configurations  for 
which  tt ^  is  superior  and  all  other  populations  are  ‘just  inferior’. 

2.1.  Optimality  requirement.  Let  0  <  y  <  1  be  given.  We  wish  to 
derive  a  rule  which  satisfies 

(2.1)  sup  P  [it-  is  selected]  <  y  for  i  =  l,...,k,  and 

“io  1  1 


(2.2) 


maximi 


izes  V  -t* —  p  [it  is  selected)*!* ] 

1=1  9Tio  -  1  10 


Tio_T 


among  all  rules  satisfying  (2.1).  We  note  that  the  condition  (2.1)  con¬ 
trols  the  error  probabilities  and  the  condition  (2.2)  maximizes  the  ef¬ 
ficiency  in  a  certain  sense  of  the  rule  in  picking  out  the  superior  popu¬ 
lations  in  a  neighborhood  of  the  configuration  t  =  (t*,--.»t*). 

Let  hT(t-| ,. . .  ,tk)  denote  the  joint  density  of  Tj0»-..»Tko  with  respect 
to  a  a-finite  measure  y.  A  selection  rule  is  given  by  fl(t) 

=  {61 (t),. . . ,6k(t)},  where  fi^t)  is  the  probability  of  selecting  ^  when 
t  =  (t, ,...,tj  is  observed.  Further,  let  h^*(t)  denote  the  density  h^(t) 
when  t  =  (t*,...,t*),  and  let  hj*^(t)  denote  the  partial  derivative 

h  (t)  evaluated  at  r  =  (t*,...,t*).  Finally,  we  need  to  assume  cer- 
9Tio  T-  ' 

tain  regularity  conditions,  namely,  that  hT(t)  is  continuously  differenti¬ 


able  with  respect  to  each  component  of  t  and 


h  (t)  is  integrable. 
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Under  these  regularity  conditions,  it  is  easy  to  see  that 


^  selected |n|Q] 


*io=T* 


=  /  6i(t)h<:)(t)dp(t) 


where  l  is  the  sample  space  of  (T,  ,. . .  .T^). 
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Summarizing  the  above  discussion,  we  are  seeking  a  rule  <s(t)  which 
satisfies 

(2.3)  /  61-(t)hT(t)du(t)  <  y  for  r  €«i0»  1  =  1 » —  ,k , 

and  maximizes,  among  all  rules  satisfying  (2.3),  the  expression 

k 

(2.4)  l  J  6.(t)hJV(t)dy(t). 

i=l  J  1  '  T 

2.2.  A  locally  optimal  rule.  We  now  state  and  prove  the  main  theorem 
of  this  section. 

Theorem  2.1.  Under  all  the  assumptions  stated  previously,  a  rule 
6°(t)  which  satisfies  (2.3)  and  maximizes  (2.4)  among  all  rules  satsifying 
(2.3)  is  given  by 

1  > 

(2.5)  «°(t)  =  x  if  hjp(t)  =  cihT*(t) 

0  < 

where  0  <  x  <  1  and  ci  are  determined  by 

(2.6)  /  6*?(t)h  *(t)dy(t)  =  y»  i  =  l,...,k. 

J  T 

Proof.  The  proof  is  straightforward  by  noting  that  for  any  rule  6 
satisfying  (2.3)  we  have 

k  ... 

(2. 7)  J  [«1(t)-6?(t)][hJiJ(t)-C.hT*(t)]dp(t)  <0. 

2.3  Applications  to  special  cases.  We  consider  the  following  cases: 

(A)  normal  means  comparison;  corrmon  known  variance  and  unequal  sample  sizes, 

(B)  normal  means  comparison;  common  unknown  variance  and  unequal  sample 
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sizes,  and  (C)  gamma  scale  parameters  comparison;  unequal  degrees  of  free¬ 
dom. 


Case  A:  Normal  means  comparison;  common  known  variance  and  unequal 

O 

sample  sizes.  Let  be  N(e^,o  )  and  5(^  be  the  sample  mean  based  on  ni 
independent  observations  from  ■*.. ,  i  =  0,1,..., k.  We  take  Tio  =  -  XQ 

and  =  0,-  -  0_.  Here  t*  =  0.  The  joint  density  h  (t,,...,t.)  of 
T-Iq,...,!^  is  that  of  a  multivariate  normal  distribution  with  mean  vector 
r  and  covariance  matrix  z  given  by 


where 


nQ/n^ ,  i  =  l,...,k.  It  is  easy  to  verify  that 


In  the  particular  case  in  which  nQ  t  n^  =  ...  =  n^,  we  have  =  ... 
=  =  a  (say).  In  this  case,  it  is  easy  to  evaluate  .  The  rule  6° 
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PT*  {Zi  -  -n?  I  ziici}  =  Y. 

T  1  k+a  j=l  J  1 


being  normal  variables  with  E(Z^)  =  0,  V  ( Z  ^ (1  +  a  ),  and 

2  0 
E(Z.Zj)  =  .  The  left  hand  side  of  (2.9)  is  equal  to 

P  *{(k+«2-l)Z.  -  I  Z.  >  Cl.(k+a2)}  =  1  -  <t(c.(k+«2)/n), 
j?  i  J 

where  $(•)  is  the  standard  normal  cdf  and 
2 

n2  =  [(1  +  cx2)(k  +  a2-  1)2+  (1  +  a2)(k-  1)  -  2(k-  l)(k+  1  -  a2)+  (k-  l)(k-  2)]. 


Thus, 


(2.10) 


Jl_ 


♦  (1  -  Y  )  * 


Remark  2.1.  The  rule  6°  given  by  (2.8)  is  of  the  form:  select 

if  and  only  if  t •  _>  a  ttV  T  t.  -  c,  where  c  >  0  and  a  <  1.  If,  however, 

jfi  J 

we  take  a  =  1,  then  we  have  the  so-called  average-type  rule. 


Case  B:  Normal  means  comparison;  common  unknown  variance  and  unequal 


sample  sizes.  As  in  Case  A,  X.  is  the  sample  mean  based  on  n^  observations 

2  2 
from  tt .j .  Let  s  be  the  usual  pooled  unbiased  estimator  of  a  based  on 

v  =  (k+l)(n-l)  degrees  of  freedom.  Define  Z^  =  (5(.  -  X  )/  /-J-+  yj-  and 

/  ni  no 

Tio  =  ZiZs»  1  =  l».**»k.  Let  ft  =  (p^j)  where  =  Corr(Zi,Zj).  Then 
2  2  -1  2 

*  [(1  +  )(1  +  aj)]  2  where  =  nQ/n..  Let  A  =  (a^)  be  the  inverse 
of  ft.  Then  the  joint  density  h  (t-j » . . .  ,tk )  of  the  Tifl  is  the  multivariate 
t-density  (see  Dunnett  and  Sobel  [3])  given  by 


(2.11)  hT(tr...,tk)  = 


n,  1  ya  (t  Wt  n-i(v+k) 

(vn)k/2r(f)  v  i J  ij  1  Ti0  j  Tj0  3 


where  Tio  =  ei  -  0O»  i  =  1» —  ,k. 
Now,  it  is  easy  to  verify  that 
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T 


a  •  ■ t .+  Y  a . -t . 
y+k  11  1  jl  ^  J 

V  1+  ~  l  a.,t.t. 
v  1J  i  J 


=  t(t;v,k,{a^ j }) ,  say. 


Thus  the  rule  6°  is  given  by 

II  if  t(t;v,k,{ai .})  >_  c 

0  otherwise. 

In  the  particular  case  in  which  nQ  f  n^  =  n^  =  ...  =  n^,  we  get 
2 

p.j j  =  1/(1  + a  );  where  g-|  =  ...  =  ak  =  g.  This  gives 

i  =  j 

i  /  j  . 

The  constant  c^  is  given  by 


aij 


(l+g2)(k-l+g2) 

g2(k+g2) 


1  +a 


g(k+g2) 


(2.13)  (T-jqj -  >  >  jT^q jv *k, {a^ j })  >^ c •  ]  y- 

It  can  be  easily  seen  that  (2.13)  can  be  rewritten  as 

(2.14)  P[diY'AY- A^Y+vd^O]  =  y. 


where  d^  =  c^/(v  +  k).  A*  =  [A1  ...  Ak],and  Y1  =  (Y, . . .  Y.  )  has  the  multi¬ 
variate  t  density  given  by  (2.11)  with  t10  =  0,  i  =  l,...,k.  The  left 
hand  side  of  (2.14)  is  equal  to 

P[di  (Y  -  237  A'\  J1  A(Y  -  )  +  vd.  -  A’A^A.  <  0] 

2 

=  p[di “  A:  a’\  >*  A<?*  A:  A"lAi )  +  sZydi '  A:  AiA’lAi < 0] 

o 

where  Z  -  N(0,o  £2), 
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P[d  — — A-1A, )  ’  A(—  Z--^  — - —  A_1A.) 

1  0  '  0  2d. /v  1  0  '  a  2d, AT  1 


2  2 

+  d  .  yS_  I  a'A'^  <  01 
2  ai  2  4d.v  AiA  Ai-  J' 


O  1 


s^~  2 

In  order  to  evaluate  the  above  probability,  we  note  that  U  =  ~  x  and  that, 

o 

2 

conditioned  on  U  =  u,  the  quadratic  form  has  a  non-central  x  distri  ution 

on  k  degrees  of  freedom  with  the  noncentrality  parameter  x  =  x(u,d^) 

=  U  2  *A.j •  Thus  the  constant  c^  is  given  by 
4vd.j 


(2.15) 


^  P[di(Vi  +  u  -  x(u,di  )}_<  0]gv(u)du  =  y , 


where  has  a  noncentral  x  distribution  on  k  degrees  of  freedom  with  non- 

2 

centrality  parameter  x(u,d..),  and  g^(u)  is  the  central  x  density  on  v 
degrees  of  freedom.  Tables  for  the  constant  d^  are  expected  to  be  part  of 
a  future  paper. 

Case  C:  Gamma  scale  parameters  comparison;  unequal  shape  parameters. 
Let  it.,  (i  =  0,1, ...,k)  be  a  gamma  population  with  the  density 


(2.16) 


f(x;8.,v.)  =  - —  expl-x/e^ },  x  _>  0,  e  >  0. 

rlv,).”1 


We  take  =  0^/eo  so  that  t*  =  1 .  Let  X,,...,X^  be  independent  observa¬ 
tions  from  these  populations.  Define  l\o  =  X^/XQ,  i  =  l,...,k.  The  joint 
density  of  T^, . . .  ,T^0  is  easily  derived  to  be 


hT<i>  ■  rs 


k  v.-l 

n  t.1 
i=l  1 
k  t.l  m 


k  v.>  f  k  t." 

,."orN<K'»lr  t,  ^ 


(2.17) 
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where  m  =  vQ  +  ...  +  From  this,  we  get 


(2.18) 


h[i)(t) 

h  *(t)  ~  l+t,+...+t.  '  vi ‘ 


mt^ 


Thus  the  rule  6°  is  given  by 


(2.19) 


«®(t)  = 


1  mti  >.  (vi  +  0^(1  +  t1  +  ...  +  tk) 


0  otherwise. 


The  constant  is  given  by 
(2.20)  P 


mX. 


T*Uo+...+xk  -  Vi  +  Ci  )  =  Y 

where  X  ,...,Xk  are  independent  standardized  (e=l)  gamma  variables  with 
degrees  of  freedom  v0».*.*vk»  respectively.  It  is  known  that  X^/ (Xq+. . ,+X^ ) 
has  a  beta  distribution  with  parameters  and  m  -  ,  denoted  by 

B(v^,m-v1- )•  Thus  (v.j  +  c.j  )m~^  is  the  upper  100  y%  point  of  )  dis¬ 

tribution.  So  c.j  can  be  obtained  from  the  tables  of  the  incomplete  beta 
function. 

It  should  be  pointed  out  that  the  above  problem  includes  that  of  comparing 
normal  variances  based  on  unequal  sample  sizes. 


3.  A  locally  strongly  monotone  invariant  subset  selection  rule  based 
on  ranks.  In  this  section,  we  consider  the  goal  of  selecting  a  subset 
containing  the  best  population.  Subset  selection  procedures  based  on  ranks 
have  been  earlier  studied  by  several  authors  notably  among  whom  are  Gupta, 
Huang  and  Nagel  [9],  Gutpa  and  McDonald  [11],  McDonald  [13,14],  and  Nagel 
[15].  A  detailed  account  of  these  procedures  is  available  in  Gupta  and 
Panchapakesan  [12].  Of  these  authors,  Nagel  [15]  and  Gupta,  Huang  and 
Nagel  [9]  have  discussed  locally  optimal  selection  rules  but  using 
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different  criteria  from  ours.  The  setup  of  our  investigation  is  analogous 
to  that  of  Gupta,  Huang  and  Nagel  [9].  We  assume  a  parametric  model  but 
seek  procedures  based  on  ranks  for  robustness  reasons  as  explained  in 
Section  1. 

3.1.  Preliminaries.  Let  be  k  independent  populations  where 

7i .j  has  the  associated  distribution  function  F(x,o^)  assumed  to  be  absolutely 
continuous  with  density  f(x,e^)  which  is  known  except  for  the  value  of  the 
parameter  e^.  The  parameter  belongs  to  some  interval  on  the  real  line 
containing  the  origin.  It  is  further  assumed  that  the  density  f(x,e)  satis¬ 
fies  the  set  of  conditions  in  (3.3).  The  population  associated  with  the 
largest  is  called  the  best  population.  In  case  of  a  tie,  one  of  the  con¬ 

tenders  is  assumed  to  be  tagged  as  the  best.  Let  V,  ~  {§|e-  (e^  ...,e^)}  and 
=  {e|en  =  ...  =  Let  { X-  • } ,  j  =  1 , . . .  ,n ,  be  independent  observations 

from  71  ^  and  let  R^j  denote  the  rank  of  in  the  pooled  sample  of  N  =  kn 
observations.  The  smallest  observation  has  rank  1  and  the  largest  N. 

Definition  3. 1 .  A  rank  configuration  is  an  N-tuple  a  =  (Aj  , . . .  ,A^), 

A.j  £  {1 ,2,. . .  ,k},  where  a^  =  j  means  that  the  ith  smallest  observation  in 
the  pooled  sample  comes  from  77j. 

Let  c  =  {a}  dentoe  the  set  of  all  rank  configurations  for  fixed  k  and 
n.  For  fixed  a,  let  X  =  {x  £  2  |  ax  =  A } ,  where  X  -  {x:  x  =  (x-j , . . .  ,xN)} 
and  ax  denotes  the  rank  configuration  of  x  =  (x, . ,x^).  The  decision 
space  &  consists  of  all  2  subsets  (including  the  empty  set)  of  the  set 

{1,2 . k).  Any  subset  is  denoted  by  d  so  that  &  =  jd|dc{l,2,...,kl}. 

Any  decision  d  =  {ipi^. . . . , i r }  corresponds  to  selecting  the  subset  of  the 
k  populations  comprising  77.  ,...,77.  .  Let  6 (a ,d)  denote  the  probability 

71  V 
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that  the  decision  d  is  made  if  the  rank  configuration  a  is  observed. 

Definition  3.2.  A  rank  selection  rule  is  a  measurable  function  de¬ 
fined  on  c  x  &  such  that  for  each  A  i.  C  ,  (i )  6  (A,d)  >.  0  and 

(ii)  l  <S(A,d)  =  1. 
d  £ 

Let  pi (a),  i  =  l,...,k,  denote  the  individual  selection  probabilities 
of  the  k  populations,  where 

(3.1)  p,(A)  =  l  <5  (a  ,d) 

1  d  E>  i 

the  summation  being  over  all  subsets  containing  i.  Let  P  (,»=l,...,k)  de- 

a 

note  the  probability  of  including  the  population  it  in  the  selected  subset; 

a 

in  other  words,  P  =  E  Tp  (a)1,  a  =  l,...,k.  Any  decision  d  that  corres- 

a  0  a 

ponds  to  the  selection  of  the  best  population  is  called  a  correct  decision. 
We  assume  that  is  the  best  population  and  denote  the  probability  of  a 
correct  decision  by  P(CD|«,a).  Clearly,  P (CD 1 6 ,a )  =  E0 [pk(A ) ] . 

Definition  3.3.  A  selection  rule  is  strongly  monotone  if,  for  each 
i  =  l,...,k,P.  increases  in  when  all  other  components  of  e  are  fixed, 
and  P.  decreases  in  e-,  j  f  i,  when  all  other  components  of  e  are  fixed. 

■  J 

Let  6  denote  the  group  of  permutations  g  of  the  integers  (l,2,...,k). 
We  write  g(l,...,k)  =  (gl,...,gk).  Let  h  denote  the  inverse  of  g  and  de¬ 
fine  g(xj . x^)  =  (xhl . x^)  and  gd  =  { i  | hi  €d],  d£i  .  Also,  for  any 

A  6  C  ,  iet  g  be  defined  by  gA  =  (gA-| ,. . .  ,gAN).  Thus  g  is  induced  by  g. 

Let  5  be  the  group  { g } .  Finally,  let  G(i,j)  =  {geG|gi  =  j}. 

Definition  3.4.  A  selection  rule  5  is  invariant  under  permutation  if 
and  only  if  A(gA,gd)  =  *(A,d)  for  all  ACC,  dtjk,  gcG  and  gcG. 

An  invariant  selection  rule  is  completely  specified  by  any  one  of  its 
individual  selection  probabilities.  We  wish  to  derive  an  invariant  rank 


i 


k 
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selection  rule  6  such  that 

(3.2)  inf  P  (CD | £ , A )  =  P* 

8«0  2 

where  P*(  -t-<P*<  1)  is  specified  in  advance  and  it  is  locally  optimal  in 
2k 

the  sense  that  it  is  strongly  monotone  in  a  neighborhood  of  any  point  e0  in 
Obviously,  the  infimum  of  the  probability  of  a  correct  decision  over 
that  neighborhood  is  attained  at  eQ. 


3.2.  Derivation  of  a  locally  optimal  rule.  We  assume  that  the  den¬ 
sity  f(x,e)  satisfies  the  following  set  of  conditions: 

(i)  f(x,e)  is  absolutely  continuous  in  0  for  almost  every  x; 

(ii )  the  limit 

(3.3)  f (x,0)  =  1 im  i  [f(x,e)  -  f(x,0)] 

0->O 

exists  for  almost  every  x; 


(iii)  lim  /  |f(x,0)|dx  =  /  |f(x,0)|dx  <  =°  where  f(x,o)  =  f(x,0). 

0-+O  -00  —  to  39 

Now,  we  note  that  the  distribution  of  the  ranks  does  not  depend  upon 
the  underlying  distributions  of  the  X..  when  and  thus  P  (CD 1 6 , A )  is 

1 J  “0  0 

constant  in  Hence  we  can  satisfy  (3.2)  by  choosing  any  point  eQ  in 
and  satisfying  P0  (CD | 6 , A )  =  P*.  With  no  loss  of  generality,  we  assume 


that  eQ  =  (0,...,0). 

The  probability  that  the  rank  configuration  a  is  observed  under  0  with 
ei  ?  0,  i  =  l,...,k,  is  given  by  (see  Gupta,  Huang  and  Nagel  [9]) 


k 

VA)  =  Ao  +  X  eiAi(A’2) 


(3.4) 


where 


XN  X2  N 


AQ  =  /  /  ...  /  11  f(xi  ,0)dxr .  .dxN, 

“CO  -U)  -CO  1=1 


-  XN  *2  f(x,,e,)-f(x,,0) 

A,(fl,o)  =  l  I  I  ...  I  J  V  J- 

1  —JO  —CO  —  oo  ~\ 


j-1  N 

n  f(x  ,0)  n  f(xe,o  )dx^...dxN,  i  = 
e=l  e=j+l  Ae 


Here  we  define  n  f(x. ,0)  ‘  1. 
i  =  l  1 

The  following  lemma  summarizes  some  results  of  Gupta,  Huang  and  Nagel 
[9]  for  an  invariant  rule. 


Lemma  3.1.  For  an  invariant  rank  selection  rule. 


(3.5)  P0(CD|6,a)  =  E0PkU) 

There  exists  an  t  >  0  such  that  for  all  o  with  0  j o ^  |  <  t  ,  i  =  1 . 

k,  A. (A,e)  is  approximately  equal  to  A. (a)  l  B-,  where 

j:A>i  J 


co  XN  X2  •  N 

3j  =  /  /  ...  /  f  (x .  ,0)  11  f(xe,0)dxr 

—  r~tl  —  <o  —CO  “  e=l 

e^j 


In  this  case. 


(3.6)  P ( CD 1 6 ,a)  ~  l  Pk(A)[A  +  7F-TT  {(U‘0k)V  +  (kok-U)Ak(A)  1], 
k  k  nk 

where  U  =  j  o.  and  V  =  j  A. (a)  =  )  8. ,  independent  of  A. 


Now,  for  deriving  a  locally  strongly  monotone  selection  procedure. 


consider 


(3.7) 


P 

a 


I 

A  €  C 


[Ao  +  TFTT 


{ (U -  o  )V+  (ke  -  U)A  (a) }]p  (a). 

a  a  a  a 


For  any  a  =  l,...,k,  we  get 


(3.8) 


if  *  l  \^KM 

a  A  £  C 


3  Pa 


1 


Sep  TFTT{V"A«(A)}pa(A)’  6^a- 


For  the  strong  monotonicity  property,  we  need  to  define  PaU)  such  that 

l  A  (a)p  (a)  >  0  and  £  {V-A  ( a ) }p  (a)  <  0.  This  is  obviously  accomp- 
A  £  C  a  a  A€Caa 

lished  by  defining 


(3.9) 


Pa(A)  = 


1, 
P  » 
0, 


Aa(A)  >  V  +  D 
A  (a)  =  V  +  D 

a 

A  (a)  <  V  +  D 

a 


where  D  and  p  are  to  be  chosen  such  that 

(3.10)  V  +  D  >  0  and  PQ  [Aa(A)>V+D]  +  pP0  [AaU)  =  V+D]  =  P*. 

-o  -o 

Summarizing  the  above  discussion,  we  obtain  the  following  theorem. 


Theorem  3.1.  Let  f(x,e)  satisfy  the  conditions  in  (3.3).  Then  the 
invariant  subset  selection  rule  based  on  ranks  defined  by  (3.9)  is  strongly 
monotone  in  a  neighborhood  of  any  point  e  € n  and  inf  P  (CD) A ,o )  =  P* 

e*»o  * 

provided  constants  P  and  D  satisfying  (3.10)  exist. 

Remarks.  (1)  It  is  possible  that  D  and  p  satisfying  (3.10)  may  not 
exist.  In  such  a  case,  the  rule  selects  the  empty  subset.  (2)  When  D  and 


L 


p  satisfying  (3.10)  exist,  the  rule  (3.9)  is  also  locally  optimal  in  the 
sense  it  has  the  maximum  p0(CD|6,  a) in  a  neighborhood  of  oq  among  all  in¬ 
variant  subset  selection  rules  based  on  ranks  satisfying  inf  p  (CD|$,a) 

e-^o  - 

This  follows  from  the  theorem  of  Gupta,  Huang  and  Nagel  [9].  (3)  Rules  of 

the  type  (3.9)  are  shown  to  be  "just”  by  Nagel  [15]  provided  that  is  non¬ 
decreasing  in  i,  which  is  true  for  location  parameters  if  and  only  if  f(x) 
is  strongly  unimodal  (i.e.,  -log  f(x)  is  concave).  Nagel  [15]  has  also 

shwon  that  for  a  just  rule  inf  p  ( CD [ 6 , A )  =  inf  pfl(C0|6,A). 

een2  e  €  n0  - 


3.3.  A  special  case.  If  f(x,e)  is  the  logistic  density,  i.e., 
f(x,e)  =  e’^x-e V[1  +  e  ^X_Q^]^,  then  we  get  B.  *  r  +  is,  where  the  actual 
values  of  r  and  s  >  0  are  immaterial.  The  rule  becomes 


(3.11)  P  (A) 

a 

where  p  and  C  satisfy 


1 

p 

0 


> 


n 


< 


V  j,  vc,*pV  j, 

The  above  rule  is  a  randomized  version  of  the  so-called  type  rule  of 
Gupta  and  McDonald  [11]  who  proposed  it  on  an  ad  hoc  basis.  It  should 
also  be  pointed  out  that  the  rule  is  'just'  in  the  sense  of  Nagel  [15] 
which,  as  pointed  out  earlier,  implies  that  the  infimum  of  the  probability 
of  a  correct  decision  over  n  is  P*. 
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